Corpus: jpn_wikipedia_2018_300K

Other corpora

3.13.1 Average Position of Words by Word Length

Average position of words in sentences as a function of word length

word length count avg(pos) standard deviation(pos) avg(sentence_length)
1 101575 14.9093 15.1380 2.5023
2 74402 15.4528 15.6419 2.8890
3 2918980 15.9875 14.5200 1.5805
4 46953 13.9732 15.5097 3.2262
5 9899 13.1576 14.5445 5.2428
6 2190978 15.6639 14.6554 1.5470
7 6567 13.1558 14.4321 5.7853
8 4525 13.2228 14.0044 6.0533
9 385294 14.4523 14.2886 1.5317
10 6408 19.6573 15.6442 6.5058
11 5722 21.4893 15.5143 6.4042
12 193118 14.0341 14.1930 1.6756
13 466 14.0730 15.2981 6.2768
14 225 12.6933 15.1076 6.7333
15 55183 12.5147 13.3727 1.5903
16 63 7.5238 9.0796 4.7778
17 56 15.3214 20.1646 5.8393
18 29425 11.6263 12.9133 1.6521
19 58 14.4310 20.1264 5.5690
20 34 9.8529 11.4124 7.6765
21 11567 11.1657 12.5357 1.6985
22 16 12.3125 16.9916 5.9375
23 10 13.3000 22.8081 7.5000
24 4994 11.1135 12.4896 1.7401
25 9 16.1111 13.0592 9.2222
26 9 25.8889 16.6763 15.0000
27 2088 11.9224 13.6710 1.7519
28 4 8.5000 6.5000 5.7500
29 1 25.0000 0.0000 11.0000
30 1066 11.2054 13.1061 1.8405
31 1 0.0000 0.0000 2.0000
32 5 7.2000 5.6356 4.4000
33 586 12.3635 14.2672 1.9710
34 4 13.0000 13.3604 8.7500
35 3 4.3333 4.7842 6.0000
36 350 10.0800 12.2390 1.8029
38 1 2.0000 0.0000 8.0000
39 174 9.7299 11.9603 2.0805
42 115 11.0609 12.3177 1.8435
45 70 12.2000 15.3228 2.2571
48 36 15.8611 17.7015 2.9167
51 22 14.5455 12.7019 3.7273
54 19 16.4737 17.7093 1.0526
57 6 22.3333 15.2169 1.5000
60 7 15.0000 14.0814 3.0000
63 2 0.5000 0.5000 1.0000
66 2 9.0000 3.0000 1.0000
69 2 38.5000 9.5000 7.0000
75 4 12.0000 5.3385 2.0000


Gnuplot diagram

60268 msec needed at 2021-08-22 05:01